智能论文笔记

Supplementing Recurrent Neural Network Wave Functions with Symmetry and Annealing to Improve Accuracy

Mohamed Hibat-Allah , Roger G. Melko , Juan Carrasquilla

分类：机器学习

2022-07-28

经常性的神经网络（RNN）是一类神经网络，这些神经网络已从人工智能的范式中出现，并在自然语言处理领域实现了许多有趣的进步。有趣的是，这些体系结构被证明是强大的Ansatze，可近似量子系统的基态。在这里，我们建立了[Phys的结果。Rev. Research 2，023358（2020）]并在二维中构建了更强大的RNN波函数ANSATZ。我们使用对称性和退火来获得对二维（2D）海森贝格模型的基态能量的准确估计，在方形晶格和三角形晶格上。我们表明，对于三角形晶格上的大于或等于$ 14 \ $ 14 $的系统尺寸，我们的方法优于密度矩阵ren量量量量标准（DMRG）。

translated by 谷歌翻译

Do Quantum Circuit Born Machines Generalize?

Kaitlin Gili , Mohamed Hibat-Allah , Marta Mauri , Chris Ballance , Alejandro Perdomo-Ortiz

分类：机器学习

2022-07-27

在最近针对生成任务的量子电路模型的建议中，关于其性能的讨论仅限于它们重现已知目标分布的能力。例如，诸如量子电路诞生的机器（QCBM）之类的表达模型家族几乎已经完全评估了其以高精度学习给定目标分布的能力。尽管此方面可能是某些任务的理想选择，但它将生成模型的评估范围限制在记忆数据而不是概括的能力上。结果，对模型的概括性能以及此类能力和资源需求之间的关系几乎没有理解，例如电路深度和培训数据的量。在这项工作中，我们利用最近提出的概括评估框架开始解决这一知识差距。我们首先研究了QCBM的基数受限分布的学习过程，并在增加电路深度的同时看到概括性能的提高。在此处介绍的12个问题示例中，我们观察到，只有30％的有效模式与训练集相比，QCBM表现出最佳的概括性能，以产生看不见和有效的模式。最后，我们评估了QCBM不仅可以概括有效特征的能力，而且还评估了根据充分偏见分布分布的高质量斑点。我们看到，QCBM能够有效地学习偏见并产生比培训集中的质量更高的看不见的样本。据我们所知，这是文献中的第一部作品，该作品将QCBM的概括性能作为量子生成模型的积分评估度量标准，并证明了QCBM将其推广到高质量的，所需的新型样品的能力。

translated by 谷歌翻译

Supplementing Recurrent Neural Networks with Annealing to Solve Optimization Problems

Shoummo Ahsan Khandoker , Jawaril Munshad Abedin , Mohamed Hibat-Allah

分类：机器学习

2022-07-17

组合优化问题可以通过启发式算法（例如模拟退火（SA））来解决，该算法旨在通过热搜索空间在大型搜索空间中找到全局最小值溶液。该算法通过马尔可夫链蒙特卡洛技术生成新的解决方案。后者可能会导致严重的局限性，例如缓慢的收敛性和在较小温度下保持在同一局部搜索空间内的趋势。为了克服这些缺点，我们使用了变异经典退火（VCA）框架，该框架将自回归复发性神经网络（RNN）与传统退火相结合来彼此独立于样品解决方案。在本文中，我们证明了使用VCA作为解决现实世界优化问题的方法的潜力。与SA相比，我们探索了VCA的性能，以解决三个流行的优化问题：最大切割问题（最大切割），护士调度问题（NSP）和旅行推销员问题（TSP）。对于所有三个问题，我们发现VCA在渐近极限中的平均表现要优于SA。有趣的是，我们达到了TSP最高可达256美元的城市的大型系统尺寸。我们得出的结论是，在最佳情况下，当SA无法找到最佳解决方案时，VCA可以作为一个很好的选择。

translated by 谷歌翻译

A Comprehensive Review on Autonomous Navigation

Saeid Nahavandi , Roohallah Alizadehsani , Darius Nahavandi , Shady Mohamed , Navid Mohajer , Mohammad Rokonuzzaman , Ibrahim Hossain

分类：机器人

2022-12-24

The field of autonomous mobile robots has undergone dramatic advancements over the past decades. Despite achieving important milestones, several challenges are yet to be addressed. Aggregating the achievements of the robotic community as survey papers is vital to keep the track of current state-of-the-art and the challenges that must be tackled in the future. This paper tries to provide a comprehensive review of autonomous mobile robots covering topics such as sensor types, mobile robot platforms, simulation tools, path planning and following, sensor fusion methods, obstacle avoidance, and SLAM. The urge to present a survey paper is twofold. First, autonomous navigation field evolves fast so writing survey papers regularly is crucial to keep the research community well-aware of the current status of this field. Second, deep learning methods have revolutionized many fields including autonomous navigation. Therefore, it is necessary to give an appropriate treatment of the role of deep learning in autonomous navigation as well which is covered in this paper. Future works and research gaps will also be discussed.

translated by 谷歌翻译

GraphCast: Learning skillful medium-range global weather forecasting

Remi Lam , Alvaro Sanchez-Gonzalez , Matthew Willson , Peter Wirnsberger , Meire Fortunato , Alexander Pritzel , Suman Ravuri , Timo Ewalds , Ferran Alet , Zach Eaton-Rosen

分类：机器学习

2022-12-24

We introduce a machine-learning (ML)-based weather simulator--called "GraphCast"--which outperforms the most accurate deterministic operational medium-range weather forecasting system in the world, as well as all previous ML baselines. GraphCast is an autoregressive model, based on graph neural networks and a novel high-resolution multi-scale mesh representation, which we trained on historical weather data from the European Centre for Medium-Range Weather Forecasts (ECMWF)'s ERA5 reanalysis archive. It can make 10-day forecasts, at 6-hour time intervals, of five surface variables and six atmospheric variables, each at 37 vertical pressure levels, on a 0.25-degree latitude-longitude grid, which corresponds to roughly 25 x 25 kilometer resolution at the equator. Our results show GraphCast is more accurate than ECMWF's deterministic operational forecasting system, HRES, on 90.0% of the 2760 variable and lead time combinations we evaluated. GraphCast also outperforms the most accurate previous ML-based weather forecasting model on 99.2% of the 252 targets it reported. GraphCast can generate a 10-day forecast (35 gigabytes of data) in under 60 seconds on Cloud TPU v4 hardware. Unlike traditional forecasting methods, ML-based forecasting scales well with data: by training on bigger, higher quality, and more recent data, the skill of the forecasts can improve. Together these results represent a key step forward in complementing and improving weather modeling with ML, open new opportunities for fast, accurate forecasting, and help realize the promise of ML-based simulation in the physical sciences.

translated by 谷歌翻译

AsyncFLEO: Asynchronous Federated Learning for LEO Satellite Constellations with High-Altitude Platforms

Mohamed Elmahallawy , Tie Luo

分类：机器学习

2022-12-22

Low Earth Orbit (LEO) constellations, each comprising a large number of satellites, have become a new source of big data "from the sky". Downloading such data to a ground station (GS) for big data analytics demands very high bandwidth and involves large propagation delays. Federated Learning (FL) offers a promising solution because it allows data to stay in-situ (never leaving satellites) and it only needs to transmit machine learning model parameters (trained on the satellites' data). However, the conventional, synchronous FL process can take several days to train a single FL model in the context of satellite communication (Satcom), due to a bottleneck caused by straggler satellites. In this paper, we propose an asynchronous FL framework for LEO constellations called AsyncFLEO to improve FL efficiency in Satcom. Not only does AsynFLEO address the bottleneck (idle waiting) in synchronous FL, but it also solves the issue of model staleness caused by straggler satellites. AsyncFLEO utilizes high-altitude platforms (HAPs) positioned "in the sky" as parameter servers, and consists of three technical components: (1) a ring-of-stars communication topology, (2) a model propagation algorithm, and (3) a model aggregation algorithm with satellite grouping and staleness discounting. Our extensive evaluation with both IID and non-IID data shows that AsyncFLEO outperforms the state of the art by a large margin, cutting down convergence delay by 22 times and increasing accuracy by 40%.

translated by 谷歌翻译

Hardware Acceleration of Lane Detection Algorithm: A GPU Versus FPGA Comparison

Mohamed Alshemi , Sherif Saif , Mohamed Taher

分类：计算机视觉

2022-12-19

A Complete Computer vision system can be divided into two main categories: detection and classification. The Lane detection algorithm is a part of the computer vision detection category and has been applied in autonomous driving and smart vehicle systems. The lane detection system is responsible for lane marking in a complex road environment. At the same time, lane detection plays a crucial role in the warning system for a car when departs the lane. The implemented lane detection algorithm is mainly divided into two steps: edge detection and line detection. In this paper, we will compare the state-of-the-art implementation performance obtained with both FPGA and GPU to evaluate the trade-off for latency, power consumption, and utilization. Our comparison emphasises the advantages and disadvantages of the two systems.

translated by 谷歌翻译

Multimodal CNN Networks for Brain Tumor Segmentation in MRI: A BraTS 2022 Challenge Solution

Ramy A. Zeineldin , Mohamed E. Karar , Oliver Burgert , Franziska Mathis-Ullrich

分类：计算机视觉 | 机器学习

2022-12-19

Automatic segmentation is essential for the brain tumor diagnosis, disease prognosis, and follow-up therapy of patients with gliomas. Still, accurate detection of gliomas and their sub-regions in multimodal MRI is very challenging due to the variety of scanners and imaging protocols. Over the last years, the BraTS Challenge has provided a large number of multi-institutional MRI scans as a benchmark for glioma segmentation algorithms. This paper describes our contribution to the BraTS 2022 Continuous Evaluation challenge. We propose a new ensemble of multiple deep learning frameworks namely, DeepSeg, nnU-Net, and DeepSCAN for automatic glioma boundaries detection in pre-operative MRI. It is worth noting that our ensemble models took first place in the final evaluation on the BraTS testing dataset with Dice scores of 0.9294, 0.8788, and 0.8803, and Hausdorf distance of 5.23, 13.54, and 12.05, for the whole tumor, tumor core, and enhancing tumor, respectively. Furthermore, the proposed ensemble method ranked first in the final ranking on another unseen test dataset, namely Sub-Saharan Africa dataset, achieving mean Dice scores of 0.9737, 0.9593, and 0.9022, and HD95 of 2.66, 1.72, 3.32 for the whole tumor, tumor core, and enhancing tumor, respectively. The docker image for the winning submission is publicly available at (https://hub.docker.com/r/razeineldin/camed22).

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Towards mapping the contemporary art world with ArtLM: an art-specific NLP model

Qinkai Chen , Mohamed El-Mennaoui , Antoine Fosset , Amine Rebei , Haoyang Cao , Christy Eóin O'Beirne , Sasha Shevchenko , Mathieu Rosenbaum

分类：自然语言处理 | 机器学习

2022-12-14

With an increasing amount of data in the art world, discovering artists and artworks suitable to collectors' tastes becomes a challenge. It is no longer enough to use visual information, as contextual information about the artist has become just as important in contemporary art. In this work, we present a generic Natural Language Processing framework (called ArtLM) to discover the connections among contemporary artists based on their biographies. In this approach, we first continue to pre-train the existing general English language models with a large amount of unlabelled art-related data. We then fine-tune this new pre-trained model with our biography pair dataset manually annotated by a team of professionals in the art industry. With extensive experiments, we demonstrate that our ArtLM achieves 85.6% accuracy and 84.0% F1 score and outperforms other baseline models. We also provide a visualisation and a qualitative analysis of the artist network built from ArtLM's outputs.

translated by 谷歌翻译